Tag

#model optimization

13 articles

Meta's Muse Spark 1.1 outperforms GLM-5.2 in coding and costs slightly less

This article explains how Meta's Muse Spark 1.1 outperforms GLM-5.2 in coding and cost-efficiency, focusing on advancements in hallucination reduction and model reliability.

Jul 1037

SpaceXAI Releases Grok 4.5, a Cursor-Trained Model for Coding, Agentic Tasks, and Knowledge Work at $2/M Input

This explainer explores SpaceXAI's Grok 4.5, a Cursor-trained model optimized for coding, agentic tasks, and knowledge work, examining its advanced architecture, training methodologies, and implications for AI deployment.

Jul 835

Deepseek's DSpark boosts AI speed by up to 85 percent, a strategic win under tightening US export controls

Learn how to implement a simplified version of Deepseek's DSpark technique that boosts AI performance by using a small model to generate candidates and a larger model for validation.

Jun 2939

In the Weights is your new AI-centric vanity search

Learn about the In the Weights score, a novel AI evaluation metric that analyzes neural network parameters to predict model performance and optimize training.

Jun 2041

tech

Inside Automat-it’s playbook for scaling AI startups on AWS

This article explains how AI startups scale their infrastructure on AWS, covering GPU computing, model optimization, and orchestration techniques.

Jun 1039

Researchers pinpoint why larger language models pick up skills that small ones miss

Learn how to improve model performance on rare tasks by adjusting training data frequency, using practical Python examples.

Jun 631

Hexo Labs Open-Sources SIA: A Self-Improving Agent That Updates Both the Harness and the Model Weights

Learn how to create a basic Self-Improving Agent (SIA) that can update both its problem-solving framework and model weights, inspired by Hexo Labs' open-source SIA system.

May 2855

Qwen3.6-27B beats much larger predecessor on most coding benchmarks

This article explains how Alibaba's Qwen3.6-27B model outperforms its much larger predecessor on coding benchmarks, highlighting advancements in parameter efficiency and model optimization techniques.

Apr 25104

Xiaomi Releases MiMo-V2.5-Pro and MiMo-V2.5: Matching Frontier Model Benchmarks at Significantly Lower Token Cost

This article explains how Xiaomi's MiMo-V2.5 models achieve frontier-level AI performance with significantly lower token costs, focusing on agentic AI, token efficiency, and advanced optimization techniques.

Apr 2280

tech

Your old iPad or Android tablet can be your new smart home panel - here's how

Learn how advanced AI optimization techniques enable repurposing old tablets as smart home control panels through edge computing and model compression.

Apr 1797

Step by Step Guide to Build an End-to-End Model Optimization Pipeline with NVIDIA Model Optimizer Using FastNAS Pruning and Fine-Tuning

Learn to build a complete model optimization pipeline using NVIDIA Model Optimizer with FastNAS pruning and fine-tuning techniques. This beginner-friendly tutorial walks you through training, pruning, and fine-tuning a ResNet model on CIFAR-10 dataset.

Apr 296

Liquid AI Released LFM2.5-350M: A Compact 350M Parameter Model Trained on 28T Tokens with Scaled Reinforcement Learning

Learn how to work with compact language models like Liquid AI's LFM2.5-350M by setting up environments, loading models, performing inference, and understanding reinforcement learning integration.

Mar 3188